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ABSTRACT 

The Abramov-Petkovsek reduction computes an additive 
decomposition of a hypergeometric term, which extends 
the functionality of the Gosper algorithm for indefinite 
hypergeometric summation. We modify the Abramov- 
Petkovsek reduction so as to decompose a hypergeometric 
term as the sum of a summable term and a non-summable 
one. The outputs of the Abramov-Petkovsek reduction and 
our modified version share the same required properties. 
The modified reduction does not solve any auxiliary linear 
difference equation explicitly. It is also more efficient 
than the original reduction according to computational 
experiments. Based on this reduction, we design a new 
algorithm to compute minimal telescopers for bivariate 
hypergeometric terms. The new algorithm can avoid the 
costly computation of certificates. 
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1. INTRODUCTION 

Creative telescoping is a staple of symbolic summation. 
Its main use is to construct recurrence equations that have 
a prescribed definite sum among their solutions. By using 
other algorithms applicable to recurrence equations, it is 
then possible to derive interesting facts about the original 
definite sum, such as closed forms or asymptotic expansions. 

The computational problem of creative telescoping is to 
construct, for a given term polynomials £o,...,£ r 

in x only, not all zero, and another term g(x,y) s.t. 

lo{x)f(x, y )-1-1- £ r (x)f(x + r,y) = g(x, y + 1) - g(x, y). 

The number r may or may not be part of the input. 

We can distinguish four generations of creative telescoping 
algorithms. The first generation was based on elimination 
techniques [15, 22, 19, 14]. The second generation started 
with what is now known as Zeilberger’s algorithm [21, 
5, 23, 19]. The algorithms of this generation use the 
idea of augmenting an algorithm for indefinite summation 
(or integration) by additional parameters £o,...,£ r that 
are carried along during the calculation and are finally 
instantiated, if at all possible, such as to ensure the existence 
of a term g as needed for the right-hand side. See [19] for 
details about the first two generations. 

The third generation was initiated by Apagodu and Zeil- 
berger [17, 6]. In a sense, they applied a second-generation 
algorithm by hand to a generic input and worked out the 
resulting linear system of equations for the parameters £i and 
the coefficients inside the desired term g. Their algorithm 
then merely consists in solving this system. This approach 
is interesting not only because it is easier to implement 
and tends to run faster than earlier algorithms, but also 
because it is easy to analyze. In fact the analysis of 
algorithms from this family gives rise to the best output 
size estimates for creative telescoping known so far [11, 12, 
13]. A disadvantage is that these algorithms may not always 
find the smallest possible output. 

The fourth generation of creative telescoping algorithms 
originates from [7]. The basic idea behind these algorithms 
is to bring each term f{x + i, y) of the left-hand side 
into some kind of normal forms modulo all terms that are 
differences of other terms. Then to find £g,... ,£ r amounts 
to finding a linear dependence among these normal forms. 
The key advantage of this approach is that it separates the 
computation of the £i from the computation of g. This 
is desirable in the typical situation where we are only 



interested in the ii and their size is much smaller than 
the size of g. With previous algorithms there was no way 
to obtain A without also computing g, but with fourth 
generation algorithms there is. So far this approach has 
only been worked out for several instances in the differential 
case [7, 9, 8]. The goal of the present paper is to give a 
fourth-generation algorithm for the discrete case, namely for 
the classical setting of hypergeometric telescoping. 

Our starting point is the Abramov-Petkovsek reduction 
for hypergeometric terms introduced in [3] and summarized 
in Section 3 below. Unfortunately the reduced forms 
obtained by this reduction are not sufficiently “normal” for 
our purpose. Therefore, in Sections 4 and 5 we present a 
refined variant of the reduction process and show that the 
corresponding normal forms are well-behaved with respect 
to taking linear combinations. Then in Section 6 we describe 
the creative telescoping algorithm obtained from this reduc¬ 
tion. The final section contains an experimental comparison 
between this algorithm and the built-in algorithm of Maple. 

2. PRELIMINARIES 

Throughout the paper, we let F be a field of characteristic 
zero, and F(y) be the field of rational functions in y over F. 
Let (j y be the automorphism that maps r(y) to r(y + 1) for 
every r G ¥(y). The pair (F(j/),<j !/ ) is called a difference 
field. A difference ring extension of (F (y),a y ) is a ring ID) 
containing F(y) together with a distinguished endomorphism 
<j y : D —> D whose restriction to ¥(y) agrees with the 
automorphism defined before. An element c G D is called a 
constant if a y (c) = c. For a nonzero polynomial p G F[y], 
its degree and leading coefficient are denoted by deg y (p) 
and lc^(p), respectively. 

Definition 2.1. Let HD be a difference ring extension of¥(y). 
A nonzero element T G D is called a hypergeometric term 
over F(y) if a(T) = rT for some r G F (y). We call r the 
shift quotient of T w.r.t. y. 

The product of hypergeometric terms is again hypergeo¬ 
metric. Two hypergeometric terms Ti,T 2 are called similar 
if there exists a rational function r G F(y) s.t. Ti = rT-j. By 
Proposition 5.6.2 in [19], the sum of similar hypergeometric 
terms is either hypergeometric or zero. 

A univariate hypergeometric term T is called hyperge¬ 
ometric summable if there exists another hypergeometric 
term G s.t. T = A y {G), where A y denotes the difference 
of <j y and the identity map. We abbreviate “hypergeometric 
summable” as “summable” in this paper. 

Given a hypergeometric term T, we let Ut be the union 
of {0} and the set of summable hypergeometric terms that 
are similar to T. Then Ut is an F-linear subspace of D. Note 
that Ut = U h if H is a hypergeometric term similar to T. 

Recall [3, §1] that a nonzero polynomial in F[y] is said to 
be shift-free if it is coprime with any of its nontrivial shifts. 
A nonzero rational function is said to be shift-reduced if its 
numerator is coprime with any shift of its denominator. 

A basic property of shift-reduced rational functions is 
given below. 

Lemma 2.2. Let f G F(y) be shift-reduced and unequal to 
one. If there exists r G F[y] s.t. fu y (r) — r = 0, then r = 0. 

Proof. Suppose that r ^ 0. Then / = r/a y (r). Since / is 
unequal to one, r does not belong to F. It follows that / is 
not shift-reduced, a contradiction. □ 


According to [3, 4], every hypergeometric term T has a 
multiplicative decomposition SH, where S is in ¥(y) and H 
is another hypergeometric term whose shift quotient is shift- 
reduced. We call the shift quotient K := a y (H)/H a kernel 
of T w.r.t. y and S the corresponding shell. Note that I\ = 1 
if and only if T is a rational function, which is then equal 
to cS for some element c G D with a y (c) = c. 

Let T = SH be a multiplicative decomposition, where S 
is a rational function and H a hypergeometric term with a 
kernel K. Assume that T = A y {G) for some hypergeometric 
term G. A straightforward calculation shows that G is 
similar to T. So there exists r G ¥(y) s.t. G = rH. One 
can easily verify that 

SH = A y (rH) <S=> S = Ka y (r) - r. (1) 

Let Vk = {Ka y (r) — r \ r G F(j/)}, which is an F-linear 
subspace of F(y). Then (1) translates into 

SH = 0 mod Vh S = 0 mod V*. (2) 

These congruences enable us to shorten expressions. 

3. ABRAMOV-PETKOVSEK REDUCTION 

Reduction algorithms have been developed for computing 
additive decompositions of rational functions [1], hyperexpo¬ 
nential functions [8], and hypergeometric terms [3, 4]. These 
algorithms can be viewed as generalizations of the Gosper 
algorithm [16, 19] and its differential analogue [5]. 

The Abramov-Petkovsek reduction [3, 4] is fundamental 
for this paper. To describe it concisely, we need a notational 
convention and a technical definition. 

Convention 3.1. Let T be a hypergeometric term whose 
kernel is K and the corresponding shell is S. Then T = SH, 
where H is a hypergeometric term whose shift quotient is K. 
Assume that K is unequal to one. Moreover, write K = u/v, 
where u,v are polynomials in F[y] with gcd(«, v) = 1. 

Definition 3.2. A nonzero polynomialp in¥[y\ is said to be 
strongly prime with K if gcd (p,<jf' l {u)') =gcd (p,a y (v)) =1 
for all i > 0. 

The proof of Lemma 3 in [3] contains a reduction algo¬ 
rithm whose inputs and outputs are given below. 

AbramovPetkovsekReduction: Given K and S as de¬ 
fined in Convention 3.1, compute a rational function SiGF(y) 
and polynomials 6, w G F[y] s.t. b is shift-free and strongly 
prime with K, and the following equation holds: 

S = Ka v (S 1 )-Si+ ™ ■ (3) 

b ■ Gy (u) ■ V 

The algorithm contained in the proof of Lemma 3 in [3] 
is described as pseudo code on page 4 of the same paper, in 
which the last ten lines are to make the denominator of the 
rational function V in its output minimal in some technical 
sense. We shall not execute these lines. Then the algorithm 
will compute two rational functions U\ and f/ 2 . They 
correspond to Si and w/ (b<j y 1 {u)v) in (3), respectively. 

We slightly modify the output of the Abramov-Petkovsek 
reduction. Note that K is shift-reduced and b is strongly 
prime with K. Thus, b, erf 1 (u ) and v are pairwise coprime. 
By partial fraction decomposition, (3) can be rewritten as 

S = AV a (S 1 )-5 1 +^ + ^U + ^V 
\b ay^u) vj 




where a,p\,p 2 G F[y], Furthermore, we set r = p\/a y 1 (u). 
A direct calculation yields r = Ka y (—r) — (—r) + a y (pi)/v. 
Update Si to be Si — r and set p to be cr^(pi) + P 2 - Then 

S = Ka y (S 1 )-S 1 + ^ + ^-. (4) 

b v 

This modification leads to shell reduction specified below. 

ShellReduction: Given K and S as defined in Con¬ 
vention 3.1, compute a rational function Si G F(y) and 
polynomials a,b,p G F[t/] s.t. b is shift-free and strongly 
prime with K, and that (4) holds. 

Shell reduction provides us with a necessary condition on 
summability. 

Proposition 3.3. With Convention 3.1, assume that a,b,p 
are polynomials in F[y] s.t. b is shift-free and strongly prime 
with K. Assume further that (4) holds. If T is summable, 
then a/b belongs to F[j/]. 

Proof. Recall that T = SH by Convention 3.1 and it has 
a kernel K and the corresponding shell S. It follows from 
(2) and (4) that T = (a/b + p/v) H mod Uh- Thus, T is 
summable if and only if ( a/b + p/v) H is summable. 

Set H' = (1 /v)H, which has a kernel K' = u/a y (v). Note 
that b is also strongly prime with K'. We can apply Theo¬ 
rem 11 in [4] to (av/b + p) H' , which equals (a/b + p/v) H. 
Thus, a/b is a polynomial because b is coprime with v. □ 

Example 3.4. Let T = y 2 y\/(y + 1). Then the term has 
a kernel K = y + 1 and the shell S = y 2 /(y + 1). Shell 
reduction yields S = — l/(y + 2) 4- y/v mod Vk where v=l. 
By Proposition 3.3, T is not summable. 

Note that a/b+p/v in (4) can be nonzero for a summable T. 

Example 3.5. Let T = y -y\ whose kernel is K = y + 1 and 
shell is S = y. Then S = y/v mod V k, where v — 1. But T 
is summable as it is equal to A y (y\). 

The above example illustrates that neither shell reduction 
nor the Abramov-Petkovsek reduction can decide summabil¬ 
ity directly. One way to proceed is to find a polynomial solu¬ 
tion of an auxiliary first-order linear difference equation [4]. 
We show how this can be avoided in the next section. 

4. MODIFICATIONS 

After the shell reduction described in (4), it remains to 
check the summability of (a/b + p/v) H. In the rational 
case, i.e. when the kernel K is one, a/b + p/v in (4) 
can be further reduced to a/b with deg y (a) < deg (6), 
because all polynomials are rational summable. However, 
a hypergeometric term with a polynomial shell is not 
necessarily summable, for example, y\ has a polynomial shell 
but it is not summable. 

We define the notion of discrete residual forms for rational 
functions, and present a discrete variant of the polynomial 
reduction for hyperexponential functions given in [8]. This 
variant not only leads to a direct way to decide summability, 
but also reduces the number of terms of p in (4). 

4.1 Discrete residual forms 

With Convention 3.1, we define an F-linear map </>k 
from F[y] to itself by sending p to ua y (p)—vp for all p G F[y]. 
We call (j>K the map for polynomial reduction w.r.t. K. 


Lemma 4.1. Let 

W k = span F |V | l G yf deg y (p) for all pGim (</>ic) j . 
Then F[j/] = im (4>k) © W k. 

Proof. By the definition of W k, im (4>k) n W k = {0}. The 
same definition also implies that, for every non-negative 
integer m, there exists a polynomial f m G im (c/k) U Wx 
s.t. the degree of f m is equal to m. The set {/o, /i, / 2 ,...} 
forms an F-basis of F[j/]. Thus F[j/] = im (4>k) © W k- □ 

In view of the above lemma, we call W k the standard 
complement ofim(cf>K). A polynomial p G F can be uniquely 
decomposed as p = pi + P 2 with pi G im (4>k) and p 2 G W k- 

Lemma 4.2. With Convention 3.1, let p be a polynomial 
in F[j/]. Then there exists q G W k s.t. p/v = q/v mod Vx- 

Proof. Let q be the projection of p on W k- Then there 
exists / inF[y] s.t. p = 4>K(f)+q , that is, p = ua y (f)—vf+q. 
So p/v = Ka y (f) — f + q/v, that is, p/v = q/v mod Yk- □ 

Remark 4.3. Replacing p in the above lemma by vp, we 
see that, for every polynomial p G F[y], there exists q G W k 
s.t. p = q/v mod Yk. 

By Lemma 4.2 and Remark 4.3, (4) implies that 

S = y + - mod Yk, (5) 

o v 

where a,b,q G F[j/], deg y (a) < deg y (&), b is shift-free and 
strongly prime with K, and q G W k- The congruence (5) 
motivates us to translate the notion of (continuous) residual 
forms in [8] into the discrete setting. 

Definition 4.4. With Convention 3.1, we further let f be 
a rational function in F(y). Another rational function r 
in F(y) is called a (discrete) residual form of f w.r.t. K 
if there exist a, b, q in F[j/] s.t. 

f = r mod Yk and r = — + —, 

b v 

where deg !/ (a) < deg y (fe), b is shift-free and strongly prime 
with K, and q belongs to W k ■ For brevity, we just say that r 
is a residual form w. r. t. K if f is clear from the context. 

Residual forms help us decide summability, as shown in 
the next proposition. 

Proposition 4.5. With Convention 3.1, we further assume 
that r is a nonzero residual form w.r.t. K. The hypergeo¬ 
metric term rH is not summable. 

Proof. Suppose that rH is summable. Let r = a/b + q/v, 
where deg y (a) < deg ;/ (6), b is shift-free and strongly prime 
with K , and q belongs to W k- By Proposition 3.3, a/b is a 
polynomial. Since deg y (a) < deg y (6), a = 0. Thus, (q/v)H 
is summable. It follows from (1) that there exists w in ¥(y) 
s.t. ua y (w) — vw = q. Thus, w G F[j/] by Theorem 5.2.1 
in [19, page 76]. So q belongs to im (4>k). But q also belongs 
to W k- By Lemma 4.1, q = 0, a contradiction. □ 

With Convention 3.1, let r be a residual form of the 
shell S. Then SH = rH mod Uh by (2) and (5). By 
Proposition 4.5, SH is summable if and only if r = 0. Thus, 
determining the summability of a hypergeometric term T 
amounts to computing a residual form of the corresponding 
shell w.r.t. a kernel of T, which is studied below. 


4.2 Polynomial reduction 

To compute a residual form of a rational function, we 
project a polynomial on im (4>k) and on its standard com¬ 
plement W k, both defined in the previous subsection. 

Let B k = {(t , K(y l ) \ i G N}. The F-linear map <j>K is 
injective by Lemma 2.2. So B k is an F-basis of im(</x), 
which allows us to construct an echelon basis. By an echelon 
basis, we mean an F-basis in which distinct elements have 
distinct degrees. We can easily project a polynomial using 
an echelon basis and linear elimination. 

To construct an echelon basis, we rewrite im (4>k) as 

im {(j> K ) = {uA y (p) - (v - u)p | p G F[p]} . 

Set qi = deg y (u),a 2 = deg y (v), and P = deg y (v - u). 
Moreover, set tk = lc y (v — n)/ lc y (it), which is nonzero due 
to Convention 3.1 and let p be a nonzero polynomial in F[j/]. 
We make the following case distinction. 

Case 1. P > a\. Then p = a 2 , and 

4>k(p) = — lc y (u — u) lc y (p)p“ 2+des * (p ) + lower terms. 

So Ik is an echelon basis of im {4>k), in which deg a ((/>rr(2/ 1 )) 
is equal to a 2 + i for all i G N. Accordingly, W k has an 
echelon basis {l, y,..., and dim(Wic) = a 2 . 

Case 2. p = a\. Then 

4>k(p ) = — lc y (u — u) lc y (p)y ai+dee v W + lower terms. 

So B k is an echelon basis of im {4>k), in which deg y (j/ 1 )) 

is equal to op + * for all i G N. Accordingly, W k has an 
echelon basis {l, y,..., and dim(Wic) = «i- 

Case 3. {3 < a i — 1. If deg y (p) = 0, then 4>k(p) = (u — v)p. 
Otherwise, we have 

4>k{p) = deg y (p) lc y (u) lc 3/ (p) 3 /“ 1+deg « (p) - 1 + lower terms. 

It follows that B k is an echelon basis of im(^>x), in 
which degj,(^>K(l)) = P and 

degj / (0ic(y 1 )) = Qi +* - 1 for all i > 1. 

So W k has an echelon basis {l,..., y^ +1 , ■ ■ ■, j/'* 1—1 }, 

and dim(Wrc) = Qi — 1. 

Case 4- P = an — 1 an d tk is not a positive integer. Then 

<Mp) = (deg y (p) lc„(u) - lc y (u - u)) \c y (p)y ai+des y < ' p) ~ 1 
+ lower terms. (6) 

So Bk is an echelon basis of im^x), in which, for all i G N, 
deg y (</>fsr(l/ 1 )) = cni + i — 1. Accordingly, W k is spanned by 
an echelon basis |1, y,..., y“ 1-2 }, and has dimension au — 1. 
Case 5. /3 = ai — 1 and tk is a positive integer. It follows 
from (6) that for i ^ tk , deg y (</>x(y 1 )) = ai + * — 1. 
Moreover, for every polynomial p of degree tk, 4>k(p) is of 
degree less than ai +tk — 1. So any echelon basis of im (4>k) 
does not contain a polynomial of degree au + tk — 1. Set 

= {</>*• (2/’) || i G N,i ^ t k j - • 

Reducing rj>n{y TK ) by the polynomials in M' K , we obtain a 
polynomial p' with deg^p') < a\ — 1. Since B k is an F- 
basis and B^- C Bk, p 1 0. So B^-U-jj/} is an echelon basis 
of im (4>k)- Consequently, W k is spanned by an echelon ba¬ 
sis {1, y,..., y d °Sy( p ')-\y^y(r')+\ . . . , y^+TK-1 } . 

The dimension of W k is equal to ai - 1. 


Example 4.6. Let K = (y 4 + 1 )/(y + l) 4 , which is shift- 
reduced. Then tk = 4. According to Case 5, im {4>k) has an 
echelon basis 

{<t>K (p)} U {(j>K ( y m ) I m G N, m ^ 4} , 

where p = y 4 + p/3 + 1/2, <Pk(p) = (5/3 )y 2 + 2 y + 4/3, 
and (j>K (y m ) = (m — 4)p m+3 + lower terms. Therefore, W k 
has a basis {1 ,y,y'}. 

From the above case distinction and example, one observes 
that, although the degree of a polynomial in the standard 
complement depends on tk, which may be arbitrarily high, 
the number of its terms depends merely on the degrees of u 
and v. We record this observation in the next proposition. 

Proposition 4.7. With the Convention 3.1, we further let 

ai = deg y (w), a 2 = deg y (v), and /3 = deg y (v - u). 

Then there exists V C { y 1 \ i G N} with 

[P\ < maxjai, a 2 } - \f3 < ai — 1] 

s.t. every polynomial in F[p] can be reduced modulo im (4>k) 
to an ¥-linear combination of the elements in V ■ Note that 
here |[/3 < a?i — 1] equals 1 if P < a i — 1, otherwise it is 0. 

Proof. By the above case distinction, dim(Wjf) is no more 
than max{ai,a 2 } — |[/3 < cri — lj. The lemma follows. □ 

The above case distinction enables one to find an infinite 
sequence po,pi,... in F[p] s.t. 

Ek = {(j> K (pi)\i G N} with deg y pK{Pi) < deg y p K {Pi+ 1 ), 

is an echelon basis of im ( <j>K )• This basis allows us to project 
a polynomial on im (</>k) and W k, respectively. In the first 
four cases, the pi’s can be chosen as powers of y. But in the 
last case, one of the pfs is not necessarily a monomial as 
shown in Example 4.6. 

PolynomialReduction: Given p G F[p], compute / G F[p] 
and q G W k s.t. p = />rsr(/) + q. 

1. If p = 0, then set f = 0 and q = 0; return. 

2. Set d = deg y (p). Find the subset P = {pi 1} ■ ■ ■ ,Pi s } 
consisting of the preimages of all polynomials in the 
echelon basis E k whose degrees are at most d. 

3. For k = s, s — 1,..., 1, perform linear elimination to 
findc s , c s — i, ..., ci G F s.t. p~J2k=l c k<t>ic{pi k ) G W/c- 

4. Set / = X)fc =i CfcPi,, and q = p - />ic(/); and return. 

We now present a modified version of the Abramov-Pet- 
kovsek reduction, which determines summability without 
solving any auxiliary difference equations explicitly. 

ModifiedAbramovPetkovsekReduction: Given an ir¬ 
rational hypergeometric term T over F(p), compute a 
hypergeometric term H with a kernel K, and two rational 
functions f,r G F(p) s.t. r is a residual form w.r.t. K, and 

T = A y (fH)+rH. (7) 


1. Find a kernel K and the corresponding shell 5 of T; 


2. Apply shell reduction to S w.r.t. K to find b,s,t G F[p] 
and g G F(y) s.t. b is shift-free and strongly prime 
with K; and 

T = Ay ( gH) + ( S - + H, (8) 

where o y (H)/H = K and v is the denominator of K. 

3. Set p and a to be the quotient and remainder of s 
and b, respectively. 

4. Apply polynomial reduction to vp + t to find h G F[j/] 
and q G W k s.t. vp + t = (f>K{h) + q. 

5. Set / := g+h and r := a/b+q/v and return H, f and r. 

Theorem 4.8. With Convention 3.1, the modified version 
of the Abramov-Petkovsek reduction computes a rational 
function f in ¥(y) and a residual form r w.r.t. K s.t. (7) 
holds. Moreover, T is summable if and only if r = 0. 


Proof. Recall that T = SH, where H has a kernel K and S 
is a rational function. Applying shell reduction to S w.r.t. K 
yields (8), which can be rewritten as 


T = Ay ( gH ) + 


fa vp + t \ 
\b v ) 


H, 


where a and p are given in step 3 of the modified Abramov- 
Petkovsek reduction. The polynomial reduction in step 4 
yields that vp + t = ua y (h) — vh + q. Substituting this 
into (8), we see that 

T = A y (gH) + (Ka y (h)-h)H+ (| + 2) H 
= A y ((g + h)H) + rH, 

where r = a/b+q/v. Thus, (7) holds. By Proposition 4.5, T 
is summable if and only r is equal to zero. □ 


Example 4.9. Let T be the same hypergeometric term as 
in Example S.f. Then K — y + 1 and S = y 2 /{y + 1). 
Set H = y\. By the shell reduction in Example 3.f, 


T = A y 



+ 



where v = 1. Applying the polynomial reduction to ( y/v)H 
yields ( y/v)H = Aj,(l ■ H). Combining the above steps, we 
decompose T as T = A y ( y/{y + 1 )H) — (l/(y + 2)) H. So 
the input term T is not summable. 


Example 4.10. Let T be the same hypergeometric term as 
in Example 3.5. Then K = y +1 and S = y. Set H = y\. By 
the shell reduction in Example 3.5, T = yH. The polynomial 
reduction yields yH = A y (p!) , hence T = A y (y\). 

Remark 4.11. With the notation given in the step 5 
of the modified version, we can rewrite rH as (si/s 2 )G, 
where si = av + bq, S 2 = b, and G = H/v. It follows from 
the case distinction at the beginning of this section that the 
degree of si is bounded by A given in [3, Theorem 8]. The 
polynomial S 2 is equal to b in (3) whose degree is minimal 
by [3, Theorem 3]. Moreover, a y (G)/G is shift-reduced, 
because cr y (H)/H is. These are exactly the same required 
properties of the output of the original version [3]. 


5. SUM OF TWO RESIDUAL FORMS 

To compute telescopers for bivariate hypergeometric terms 
by the modified Abramov-Petkovsek reduction, we are 
confronted with the difficulty that the sum of two residual 
forms is not necessarily a residual form. This is because the 
least common multiple of two shift-free polynomials is not 
necessarily shift-free. 

The goal of this section is to show that the sum of two 
residual forms is congruent to a residual form modulo Vk- 

Example 5.1. Let K=l/y, r=l/{2y+ 1) and s—1 / (2y + 3). 
Then both r and s are residual forms w.r.t. K, but their sum 
is not, because the denominator {2y+l){2y + 3) is not shift- 
free. However, we can still find an equivalent residual form. 
For example, we haver+s = — l/(2(2j/-|-l))-|-l/2p mod Vk- 
Note that the residual form is not unique. Another possible 
choice is r + s = l/(3(2 y + 3)) + 1/3 y mod Vk- 

Let / and g be two nonzero polynomials in F[y]. We say 
that / and g are shift-coprime if gcd (/, Cy(g)) = 1 for all 
nonzero integer i. Assume that both / and g are shift-free. 
By polynomial factorization and dispersion computation, 
one can uniquely decompose 

g=~9vt 1 (pT 1 )---"t k (P™ k ), (9) 

where g is shift-coprime with /, p\,...,pk are distinct, 
monic and irreducible factors of /, l\,... ,Ik are nonzero in¬ 
tegers, mi,. .., mi, are multiplicities of (pi), ..., <j y k ( pk) 
in g, respectively. We refer to (9) as the shift-coprime 
decomposition of g w.r.t. /. 

Remark 5.2. The factors g,o^ (p^ 1 ), ..., cr y k (p/f k ) 
in (9) are pairwise coprime, since f and g are shift-free. 

To construct a residual form congruent to the sum of two 
given residual ones, we need three technical lemmas. The 
first one corresponds to the kernel reduction in [8]. 


Lemma 5.3. With Convention 3.1, assume that pi,p 2 are 
in F[j/] and m in N. Then there exist qi, q 2 in W k s.t. 


Pi 


= — mod Vk and 




P2 


n; 


3=1 U V 


'(«) 


— mod Vk. 
v 


Proof. To prove the first congruence, let w m = nr=o^o- 
We proceed by induction on m. If m = 0, then the 
conclusion holds by Lemma 4.2. Assume that the lemma 
holds for m — 1. Consider the equality 


Pi 


= Ka v 




S t 

- 1- 1 

UJm — 1 U/m — 1 


where s, t G F[y] are to be determined. This equality holds 
if and only if a y (s)u + (t — s)a™(v) = pi. Since u and o™(v) 
are coprime, such s and t can be computed by the extended 
Euclidean algorithm. Thus, pi/wm = t/w m -i mod Vk. 
Consequently, pi/wm has a required residual form by the 
induction hypothesis. 

To prove the second congruence, we use the identity 


^ =K0 y (- P2 


HU) 


\u) 


P2 


>) 


Oy (P2 


which implies that P2/o~ y ( u ) = a y (P2) /v mod Vk- By 
Lemma 4.2, there exists <72 G Wk s.t. q 2 /v is a residual 
form of p 2 /o y 1 (u) w.r.t. K. Assume that the congruence 
holds for m — 1. The induction can be completed as in the 
proof for pi /wm ■ □ 














The next lemma provides us with flexibility to rewrite a 
rational function modulo Vk- 

Lemma 5.4. Let K £ F(y) be nonzero and shift-reduced. 
Then, for every f G F(y) and every £ G iff, 

t -1 t 

f = "1(f) n (T v( k ) = n a y‘ 

i=0 i=l 

Proof. Let us show the first congruence by induction on l. 
For 1 = 1, the identity / = Ko y (—f) — (—/) + a y (f)K 
implies that / is congruent to cr y (f)K modulo Vk- Assume 
that it holds for £ — 1. Set we = fIi=o a y(K)- Then / is 
congruent to o y ~ 1 (f)we- 1 modulo Vk by the induction hy¬ 
pothesis. Moreover, (f)we-i is congruent to o y (f)wi by 
the induction base, in which / is replaced with o y ~ 1 (f)wi- 1 . 
Hence, / is congruent to o y (f)wt modulo Vk. 

The second congruence can be shown similarly. For 1=1, 
the identity / = Ko y (r) — r + r with r = o y 1 (f)o y 1 (l/K) 
implies that / is congruent to r modulo Vk- We can then 
proceed as in the proof of the first congruence. □ 


— ) mod Vk- 


and partial fraction decomposition, we have 


b 

9 


bo \ ^ bi 

J + 2^pr!H’ 

y i= 1 1 


( 10 ) 


where &o, b\,..., bk G F[j/]. Note that pi = o y li (Pi), which 
is a factor of /. Thus it is strongly prime with K. So we 
can apply Lemma 5.5 to each fraction bi/P™ 1 in (10) to get 


b 

9 


^ + £4 + ^ m odv„ 

n ' n. 1 v 


(ii) 


where b[,..., b' k G F[y] and q' G W k- 

Set h = sntiPr- Then h is shift-free and strongly 
prime with K as both / and g are. Since / is shift-free, 
all its factors are shift-coprime with /, so are the pfs, and 
so is h. Let t be the sum of q/v and the rational function 
in the right-hand side of (11). Then there exist b * in F[y] 
with deg y (&*)< deg y (h) and q* in W k s.t. t=b*/h+q*/v. 
Since / and h are shift-coprime, their least common multiple 
is shift-free. Therefore, A r + fit is a residual form w.r.t. K, 
and A r + gt is congruent to A r + gs mod Vk- □ 


Lemma 5.5. With Convention 3.1, let a,b £ F[j/] with 
b^O. Assume that b is shift-free and strongly prime with I\ . 
Assume further that a y (b) is strongly prime with K for some 
integer £, then a/b has a residual form c/o y (b)+q/v w.r.t. K, 
where c G F[y] with deg y (c) < deg y (6) and q G W k- 


Proof. First, consider the case in which £ > 0. If £ = 0, then 
there exist c,p G F with deg y (c) < deg y (6) s.t. a/b = c/b+p. 
The lemma follows from Remark 4.3. 

Assume that £ > 0. By the first congruence of Lemma 5.4, 


a 

b 




r£») 


mod Vk- 


Note that cr y (b) is strongly prime with v by assumption. 
Then it is coprime with the product va y (v) ■ ■ ■ CT y _1 (u). By 
partial fraction decomposition, we get 


a 

b 


+ 




mod Vk- 


By the first congruence of Lemma 5.3, the second summand 
in the right-hand side of the above congruence can be 
replaced by a residual form whose denominator is equal to v. 
The first assertion holds. 

The case in which £ < 0 can be handled in the same way, 
in which the second congruences of Lemmas 5.4 and 5.3 will 
be used instead of the first ones in these lemmas. □ 


We are ready to present the main result of this section. 

Theorem 5.6. With Convection 3.1, let r and s be two 
residual forms w.r.t. K. Then there exists a residual form t 
congruent to s modulo Vk s.t., for all A, g £ F, Ar + gt is a 
residual form w.r.t. K congruent to Ar + gs modulo Vk- 

Proof. Let r = a/ f+p/v and s = b/g+q/v, where a, f,b,g G 
F [y], deg y (o) < deg(/), deg y (b) < Aeg y (g), p,q G W K , 
and /, g are shift-free and strongly prime with K. 

Assume that (9) is the shift-coprime decomposition of g 
w.r.t. /. Set Pi = a y '(pi) for i = 1, ..., k. By Remark 5.2 


6. TELESCOPING VIA REDUCTIONS 

Let C be a field of characteristic zero, and C(x,y) be the 
field of rational functions in x and y over C. Let a x ,a y be 
the shift operators w.r.t. x and y, respectively, defined by, 

°x(f(x,y)) = f(x + l,y) and a v (f(x,y)) = f(x,y + 1), 

for any / G C(x,y). Then the pair (C(x, y), {o x , a y }) forms 
a partial difference field. 

Definition 6.1. Let D be a partial difference ring extension 
of C(x,y). A nonzero element T £ D is called a hyper¬ 
geometric term over C(x,y) if there exist f,g G C(x,y) 
s.t. o x (T) = fT and a y (T) = gT. We call f,g the shift 
quotients ofT w.r.t. x andy, respectively. 

An irreducible polynomial p G C\x, y\ is said to be integer- 
linear over C if there exist / G C[z], m, n G Z with n > 0 
and gcd(m,n) = 1, s.t. p = f(mx + ny). A polynomial 
in C[x,y\ is said to be integer-linear over C if all of its 
irreducible factors are integer-linear. A rational function 
in C(x, y) is said to be integer-linear over C if its denominator 
and numerator are integer-linear. 

Let F be the field C(x), and F (S x ) be the ring of linear 
recurrence operators in x, in which the commutation rule 
is that S x r = a x (r)S x for all r G F. The application of 
an operator L = X)f = o to a hypergeometric term T is 
defined as L(T) = J2i=o^ ia ^( T )- 

Definition 6.2. Let T be a hypergeometric term over¥(y). 
A nonzero operator L G F (S x ) is called a telescoper for T 
if there exists a hypergeometric term G s.t. L(T) = A y (G). 
We call G the certificate of L. 

For hypergeometric terms, telescopers do not always exist. 
Abramov presented a criterion for determining the existence 
of telescopers in [2, Theorem 10]. Let K=u/v be a kernel 
of <j y (T)/T and S the corresponding shell. Applying the 
modified Abramov-Petkovsek reduction w.r.t. y to T yields 
T = A y (uH) + rH, where u G F(y), H = T/S, and 
r = a/b + q/v is the residual form of S w.r.t. K. By 
Abramov’s criterion, T has a telescoper if and only if b is 





integer-linear over C. When telescopers exist, Zeilberger’s 
algorithm [21] constructs a telescoper for T by iteratively 
using the Gosper algorithm to detect the summability 
of L(T) for an ansatz L = Yhi=o^i^x £ V(S X ). 

Following the creative telescoping algorithms based on 
Hermite reductions [7, 10, 9, 8] in the continuous case, we 
use the modified Abramov-Petkovsek reduction to develop 
a telescoping algorithm, which is outlined below. 

ReductionCT: Given a hypergeometric term T with shift 
quotients / = a x (T)/T and g = a y (T)/T in F(y), compute 
a telescoper of minimal order for T and its certificate if 
telescopers exist. 

1. Find a kernel K and shell S of T w.r.t. y s.t. T = SH 
with K = <x y (H)/H. 

2. Apply the modified Abramov-Petkovsek reduction to T 
to get 

T=A y {u 0 H) + r 0 H. (12) 

If ro = 0, then return (1 ,mo H). 

3. If the denominator of ro is not integer-linear, return 
“No telescoper exists!”. 

4. Set N : = a x (H)/H and R := £o?’o, where £o is an 
indeterminate. 

For i = 1,2,..., do 

4.1. View cr x (ri-i)NH as a hypergeometric term with 
kernel K and shell a x (ri-i)N. Using shell reduc¬ 
tion w.r.t. K and polynomial reduction w.r.t. K, 
find u[ G F and a residual form f; w.r.t. K 
s.t. cr x (n-i)NH = A y (u'iH) + hH. 

4.2. Set ui = a x (ui-i)N + u[, so that 

(J X {T) = A y (iiiH) + fiH. (13) 

4.3. Follow the proof of Theorem 5.6 to compute Ui 
and ri in F(y) s.t. ri = fj mod Yk, 

a i x (T) = A y (u i H) + r i H, (14) 

and that R + hri is a residual form w.r.t. I \, 
where li is an indeterminate. 

4.4. Update R to be R + iin. 

4.5. Find ij G F s.t. R = 0 by solving a linear 
system in £o , ■ ■ ■ ,£i over F. If there is a nontrivial 
solution, return {^2) =0 £jS 3 x , 

Theorem 6.3. Let T be a hypergeometric term over¥(y). 
If T has a telescoper, then the algorithm ReductionCT 
terminates and returns a telescoper of minimal order for T. 

Proof. By Theorem 4.8, ro = 0 implies that 1 is a telescoper 
for T of minimal order. 

Let ro obtained from step 2 be of the form ao/bo + qo/v, 
where a 0 ,b 0 ,v G F[y], deg y (a 0 ) < deg y (6 0 ), b 0 is strongly 
prime with K, qo G Wk, and v is the denominator of K. 
By Ore-Sato’s theorem [18, 20] on hypergeometric terms, K 
is integer-linear and so is v. It follows that bo is integer- 
linear if and only if bov is. By Abramov’s criterion, T has 
a telescoper if and only if the denominator of ro is integer- 
linear. Thus, steps 2 and 3 are correct. 

It follows from (12) and cr x (roH) = a x (ro)NH that (13) 
holds for i = l. By Theorem 5.6, there exists a residual 
form n w.r.t. K with n = ri mod Yk s.t. R + ^iri is again 


a residual form for all lo,li G F. Indeed, the proofs of the 
lemmas and Theorem 5.6 enable us to obtain not only n 
but also a rational function gi s.t. fi = Ka y (g\) — g i + n. 
Setting Mi = Mi + gi, we see that (14) holds for is 1. By a 
direct induction on i, (14) holds in the loop of step 4. 

Assume that L = Yli=o Ci ^ a telescoper of minimal 
order for T with a G F and c p ^ 0. Then L(T) is 
summable. By Theorem 4.8, c * r * i s equal to zero. 

Thus, the linear homogeneous system (over F) obtained by 
equating 5Zf=o ^ iri zero ^ as a nontrivial solution, which 
yields a telescoper of minimal order. □ 

Remark 6.4. The algorithm ReductionCT separates the 
computation of minimal telescopers from that of certificates. 
In applications where the certificates are irrelevant, we can 
drop step 4-2 and in step f.3 compute Ui and n with n = 
fi mod Vk and a x (n-i)NH = A y (uiH)+nH and that R+ 
liri is a residual form w.r.t. K, where li is an indeterminate. 
The rational function u ; can be discarded , and we do not 
need to calculate Yl]=o^i u i^ erw ^- 

Remark 6.5. Instead of applying the modified Abramov- 
Petkovsek reduction to a x (n~i)NH in step 4-l> it 
possible to apply the reduction to a x (T), but our experiments 
suggest that this variant takes considerably more time. 

Example 6.6. Let T = ( y ) 3 - Set f and g to be a x (T)/T 
and a y (T)/T, respectively. Since g is shift-reduced w.r.t. y, 
its kernel is equal to g itself, and the corresponding shell is 
equal to 1. In step 4, we obtain a y (T) = ( qi/v)H mod U h, 
where i= 0,1, 2, v = (y + l) 3 , H has shift quotient g w.r.t. y. 

qo = ^(x+l)(x 2 —x+2,y(y— a:+l)+l), qi = (x + l) 3 , and 

q 2 = + (llx 2 - 12 xy + 17m + 20 + 12y + 12y 2 ) . 

(x + 

By finding an F -linear dependency among qo,qi,q 2 , we get 
L := (x + 2) 2 S' 2 - (7m 2 + 21m + 16)5 B - 8(m + l) 2 
is a telescoper of minimal order for T. 

7. IMPLEMENTATION AND TIMINGS 

We have implemented our algorithms in Maple. In order 
to get an idea about their efficiency, we compared their 
runtime and memory requirements to the performance of 
known algorithms. All timings are measured in seconds on 
a Linux computer with 388Gb RAM and twelve 2.80GHz 
Dual core processors. 

For the first comparison, we considered univariate hyper¬ 
geometric terms of the form 

T = f(y ) r (y ~ «) 

gi(y) 92 (y) r {y-PY 

where / G 7L[y\ of degree 20, gi = pia y (pi)a^(pi) with pi G 
7L[y\ of degree 10, 1 ,/iGN, and a, j3 G TL. For a selection of 
random terms of this type for different choices of p and A, 
Table 1 compares the timings of Maple’s implementation 
of the classical Abramov-Petkovsek reduction (AP) and our 
modified version (MAP). We apply the algorithms to T as 
well as to the summable terms a y (T) — T. 

For the second comparison, we considered bivariate hy¬ 
pergeometric terms of the form 

T = f{x,y) T(2ax + y) 
gi{x + y)g 2 { 2 x + y) F(x + ay) 






T 

AP 

MAP 

*v(T) 

AP 

— T 
MAP 

(0,0) 

0.45 

0.30 

4.41 

2.29 

(5,5) 

5.94 

1.21 

10.19 

2.40 

(5,10) 

14.69 

2.20 

15.67 

3.30 

(10,10) 

17.22 

2.31 

17.98 

2.77 

(10,20) 

57.05 

6.20 

38.03 

3.80 

(10,30) 

316.51 

15.73 

74.55 

3.73 

(10,40) 

514.84 

32.64 

134.29 

3.99 


Table 1: Comparison of the Abramov-Petkovsek 

reduction and the modified version for a collection of 
non-summable terms T and summable terms o- y (T) — T. 

with / G %[x,y] of degree n, gi = Pio^(pi)of(pi) with pi G 
Z[z] of degree m, and a,A,/i£ N. For a selection of random 
terms of this type for different choices of n, m, a, p, X, 
Table 2 compares the timings of Maple’s implementation of 
Zeilberger’s algorithm (Z) and two variants of the algorithm 
ReductionCT from Section 6: For the column RCTi we 
computed both the telescoper and certificate, and for RCT 2 
we only compute the telescoper. The difference between 
these two variants comes mainly from the time needed to 
bring the rational function u in the certificate uH on a 
common denominator. When it is acceptable to keep the 
certificate as an unnormalized linear combination of rational 
functions, the timings are virtually the same as for RCT 2 . 


( m, n, a, X, p) 

Z 

RCTi 

RCT 2 

order 

(L0,1,5, 5) 

17.12 

5.00 

1.80 

4 

(1,0, 2, 5, 5) 

74.91 

26.18 

5.87 

6 

(1,0, 3, 5, 5) 

445.41 

92.74 

17.34 

7 

(1,8, 3, 5, 5) 

649.57 

120.88 

23.59 

7 

(2,0,1,5,10) 

354.46 

58.01 

4.93 

4 

(2,0,2,5,10) 

576.31 

363.25 

53.15 

6 

(2,0,3,5,10) 

2989.18 

1076.50 

197.75 

7 

(2,3,3,5,10) 

3074.08 

1119.26 

223.41 

7 

(2,0,1,10,15) 

2148.10 

245.07 

11.22 

4 

(2,0,2,10,15) 

2036.96 

1153.38 

153.21 

6 

(2,0,3,10,15) 

11240.90 

3932.26 

881.12 

7 

(2,5,3,10,15) 

10163.30 

3954.47 

990.60 

7 

(3,0,1,5,10) 

18946.80 

407.06 

43.01 

6 

(3,0,2,5,10) 

46681.30 

2040.21 

465.88 

8 

(3,0,3,5,10) 

172939.00 

5970.10 

1949.71 

9 


Table 2: Comparison of Zeilberger’s algorithm to 
reduction-based telescoping with and without construc¬ 
tion of a certificate 
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